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ABSTRACT 


Introduction: At the College of Languages and Translation (COLT), the students take four speaking 
courses in the first four semesters of college. In each course, they take two interm tests and a final 
exam. 

Aims: The study aims to find out which speaking assessment method EFL students and instructors 
at COLT prefer: Face to face, or lab testing, and reasons for their preferences. 

Procedures: Face-to-face assessment is the most common method at COLT. The students are 
tested individually. Each student randomly draws one topic from a basket and engages ina 
conversation or interview with the teacher who asks 3-5 questions about it. The test session for all 
the students lasts from 8 a.m. to 1 p.m. non-stop. The second method is language lab testing used 
by few instructors where all the students take the test at the same time and they all answer the 
same set of questions in about 60 minutes. The lab test consists of several questions. The 
students read the printed questions, take short notes, then record their answers on an MP3 player. 
Results: Most students prefer lab assessment, whereas most instructors prefer face-to-face 
assessment. Students reported that lab test questions are comprehensive and help them improve 
their speaking skills better than face-to-face assessment. Testing conditions are the same. They 
are less anxious. They lose few marks if they miss a question. On the other hand, instructors 
reported that face-to-face test questions are easy and only cover part of the material. All students 
pass. Students are more anxious. Comparisons of students’ test score showed that lab tests are 
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more reliable, valid and have a better discriminating power between students who have mastered 
and those who have not mastered the speaking skills than face-to-face tests. Recommendations for 
effective speaking assessment methods are given. 


Keywords: Speaking assessment; speaking tests; speaking tasks; speaking in EFL; EFL learners; 


lab testing; face-to-face testing. 
1. INTRODUCTION 


The speaking skill in the second/foreign language 
is difficult for students to develop and challenging 
for teachers to teach and assess because it is a 
productive skill which requires students to be 
unique and creative in the way they express 
themselves orally. A review of the second 
language teaching and learning literature showed 
a plethora of studies that investigated several 
aspects of speaking assessment. The first line of 
research focused on standardized tests such as 
the IELTS and TOEFL [1-5] and on specific- 
purpose speaking tests such as the Occupational 
English Test for assessing the language 
proficiency of health professionals [6-8] and 
others. 


A second line of research focused on variables 
that affect speaking assessment such as the 
effect of task type, and students’ first language 
background on the internal structure of the test 
[9,10]; the impact of raters' language background 
on their judgements of the speaking performance 
in the College English Test-Spoken English Test 
(CET-SET) of China [11]; variability in raters’ 
scoring decisions and their expertise in scoring 
speaking tests [12]; inter-examiner reliability in the 
pronunciation assessment component of 
speaking tests [13]; the criteria English teachers 
associate with general oral proficiency [14]; 
scoring procedures and rating scales such as 
checklists vs rubrics with level descriptors [15]; 
divergent interpretations of the test performance 
by the students and variation in interlocutor 
behaviour [16]; and topical knowledge, anxiety, 
and integrated speaking test performance [17]. 


A third line of research specifically focused on the 
methods utilized in speaking assessment, such as 
using interviews or conversations [16]; Oral 
Proficiency Interviews [18]; Dynamic Assessment 
based on Vygotsky's Zone of Proximal 
Development, in which the students receive a pre- 
test and self-reflection, feedback and knowledge 
expansion, post-test and self- reflection, post- 
feedback, and semi-structured interviews [19]; 
guiding students while listeningto audio files, and 
having students transcribe, 
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analyse, self-assess their own speaking samples, 
and record feedback on their own performance 
[20]; Story Retelling Speaking Tests [21]; Group- 
based speaking tests such as group oral 
discussion [22-24]; rehearsed speech where the 
students memorize scripts for their oral responses 
[25]; pressured vs. unpressured on- line testing 
conditions [1]; paired tasks as in the paired format 
of the Cambridge Speaking Tests [26,27]; 
individual tasks in which the students interact with 
an examiner and another paired task in which 
they interact with other students [28]; text- 
speaking, graph-speaking, and listening-speaking 
tasks [9]; integrated and independent speaking 
tasks [29]; and answering questions, describing a 
picture and giving a presentation in a language 
lab [10]. 


At the College of Languages and Translation 
(COLT), King Saud University, Riyadh, Saudi 
Arabia, 4 speaking courses are offered to 
students in the first four semesters of the 
translation program. In each speaking course, the 
students take 2 speaking interm tests per 
semester and a final exam. The most common 
speaking assessment method is face-to-face 
testing in which the students are tested 
individually, and each student talks about a topic 
in front of the instructor. Another speaking 
assessment method is lab testing, where all the 
students take the test at the same time, andeach 
student answers a set of questions and records 
her answers using an MP3 player. 


Despite the utilization of these speaking 
assessment methods for a long time, there is lack 
of research that focuses on speaking assessment 
for EFL college level students in Saudi Arabia, 
such as speaking assessment methods that 
students and instructors prefer, their advantages 
and shortcomings, the kind of tasks speaking 
tests focus on, content covered by speaking tests, 
how speaking tests are conducted, how they are 
scored, how marks are assigned, validity, 
reliability and discriminating power of the 
speaking tests utilized, and effectiveness of 
different testing procedures. Therefore, the 
present article aims to answer the following 
questions: (i) which speaking 


assessment method EFL female college students 
and instructors prefer: face-to-face or language 
lab testing; (ii) what are the advantages and 
shortcomings of each method as perceived by 
students and instructors; (iii) which method is 
more effective in measuring the speaking skill in 
terms of the types of tasks given on the speaking 
test; (iv) how many tasks the students perform on 
the test; (v) how the students are tested, content 
sampling, breadth of coverage of the test tasks, 
validity, reliability, discrimination power, scoring 
and rater reliability and accuracy. 


Identifying students and instructors speaking 
assessment method preferences will be based on 
surveys with a sample of students and speaking 
instructors. Students and instructors’ preferences 
will be reported quantitatively and qualitatively. 
Furthermore, validity, reliability and discrimination 
power of the speaking tests inboth methods will 
be based on the final exam speaking scores of 
two groups of students in the Speaking III course, 
who were tested using the two-assessment 
methods. 


Results of the current study are useful for EFL 
college speaking instructors. They will enrich their 
understanding of speaking assessment methods, 
effectiveness of the assessment methods, their 
advantages and shortcomings, reliability, validity 
and discrimination power issues, how they are 
scored, speaking test item selection, what they 
measure and which tasks and skills they cover. 
They will give them an idea about the testing and 
grading procedures in each assessment method. 


2. METHODOLOGY 
2.1 Subjects 


The student sample consisted of 69 EFL female 
in the translation program at COLT. They have 
experience with the lab testing method in the 
Speaking III course and the face-to-face speaking 
assessment method in the Speaking | and Il 
courses. In addition to the Speaking courses, 
students at COLT take Listening, Reading, 
Writing, Grammar, and Vocabulary Building 
courses in the first 4 semesters of the translation 
program (20 hours per semester including the 
Speaking course). 


The instructor sample consisted of ten female 
instructors who have taught speaking at COLT. 
They have a Ph.D. or an M.A. degree and are 


40 


Al-Jarf; AJESS, 16(3): 38-50, 2021; Article no.AJESS.67216 


specialized in linguistics or teaching English as a 
foreign language. Six instructors have used the 
face-to-face assessment method, two have used 
the language lab assessment method only, and 
another two have used_ both speaking 
assessment methods. All the instructors have 
used the same speaking textbooks. 


2.2 Materials and Tasks 


The textbooks used for the four speaking courses 
at COLT are: Interactions | & Il and Mosaic | & Il 
from McGraw Hill. The courses aim to develop 
students’ ability to talk about a particular topic for 
at least 5 minutes without preparation, with 
fluency, correct pronunciation, stress, intonation, 
grammar, choice of words and well-organized 
cohesive ideas. For example, in Speaking Ill, the 
students practice the Mosaic | speaking skills 
listed in Table 1. 


In addition, the students practice the language 
functions in Table 2. 


2.3 Speaking Assessment Methods at 
COLT 


2.3.1 Face-to-face method 


In face-to-face assessment at COLT, the test 
consists of an oral part and a written part. In the 
oral part, the students are tested face to face 
individually. The instructor prepares a set of 
questions or topics for students to talk about and 
writes each on a card (See Appendix A). The 
course instructor sits in the classroom while the 
students are outside. She calls on the students 
one by one to go in and take the test. Each 
student randomly draws one topic from a basket 
and engages in an interview or a conversation 
with the instructor during which the instructor asks 
3-5 questions on the topic. The topics differ from 
one student to another (See Appendix A). The 
instructor might also ask the students to describe 
a picture. In Levels Ill and IV, the exam consists 
of one task only in which each student gives a 
presentation about a topic in front of the class. 
She completes the conversation and/or 
presentation in 10 minutes or less. The test 
session for all the students lasts from 8 a.m. to 1 
p.m. non-stop and sometimes it lasts for two 
days if the number of students enrolled in the 
speaking courses is large. The written part is 
administered on a separate day and it is similar to 
a writing test. 
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Table 1. Speaking skills practiced in the mosaic in textbook 


e Presenting facts, feelings 

e Opinions and reasons 

e Discussing funny people 

e Telling jokes 

e Responding to questions 

e Role-playing: hopes and desires 

e Discussing personal wishes/dreams 

e Using expressions to elicit the 'truth' 

e Sharing unusual or surprising events 

e reporting surprising research 

e Role-playing: characters with definite views 

e Presenting rationales in active and passive 
voice 

e Agreeing/disagreeing with a suggestion or 
point of view 


Role-playing: eyewitness reporting 

Using humor in discussions 

Describing a scene using analogies 

Using notes to describe a process 
Conducting interviews using tag questions 
Creating chronological round-robin story 
Paraphrasing short readings 

Telling stories using digressions 
Discussing general information 

Using expressions with riddles andbrain 
teasers 

Discussing various forms of communication 
Using an outline to organize thoughts for 
presentation 

Using tone of voice to express likes and 
dislikes 

Discussing pros and cons of investment 
options 


Table 2. Sample functions practice in speaking III 


Offering and requesting clarification (does 
that make sense? Are you following me?) 


hopes and desires 

introducing information, 

expressing opinions; 

shifting focus, 

expressing wishes, 

seeking confirmation and challenges with 
tag questions 


e asking for confirmation (/ do not exactly know 


what you mean; | am not sure | am getting this) 


e making generalizations 

e likes and dislikes 

e asking for information 

e stating reasons 

e telling a joke 

e agreeing and disagreeing 


Instructors use a rubric for scoring the interm 
tests and another one for scoring the final exam 
(See Table 3 & 4). The scoring rubric consists of 
4 skills: Content and Comprehension, 

(ii) grammar, (iii) fluency and voice control, and 
(iv) pronunciation. A total of 15 marks is allocated 
to each question on the interm test (6 marks are 
allocated to content and comprehension, 3 marks 
are equally allocated to grammar, fluency and 
voice control, and pronunciation) with a total of 75 
marks for the whole Interm test. The students’ 
mark is then converted into a mark out of 15 plus 
a mark out of 10 for the written test. In the final 
exam rubric in Table 4, 10 marks are allocated to 
each question, with a total of 50 marks for the 
whole final exam. For each question, she circles 
a score for each skill that reflects a student’s 
performance (1 for poor, 3 for average and 6 for 
perfect). Scoring rubrics are not shown to the 
students. No comments or feedback on strengths 
and weaknesses are given to the students on their 
performance during the test. 
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2.3.2 Language lab assessment method 


In the language lab assessment method, all the 
students take the same test as a group at the 
same time, but they answer the questions 
individually. They complete 8-10 questions 
covering different themes, skills and language 
functions printed on a piece of paper. The test 
tasks are similar to those performed in class. To 
prevent cheating, two versions of the test 
questions are prepared (same questions, but in a 
different order). The students write their names on 
the question paper. A week in advance, the 
instructor asks the students to bring an MP3 
player, practice using the MP3 player at home, 
and test it at the beginning of the test session. 
Each student sits in her cubicle with her headsets 
on, reads the questions, thinks about them for a 
while, then records her responses to the 
questions in any order provided that she mentions 
the question number at the beginning of the 
answer. To minimize noise, the students keep 
their headsets on and record answers ina 


soft voice. Students are not allowed to write 
down answers to the questions and read them 
into the MP3 player. They are not allowed to re- 
listen to their answers, delete and re-record 
answers. At the end of the test session, the 
instructor collects the MP3 devices and test 
question papers of the students. The test session 
takes about 60 minutes. 


When the students are tested in the language 
lab, the instructor listens to each student’s MP8 at 
her own convenience at home or in the office. 
While listening to the recordings and scoring the 
students’ responses, she writes comments on 
strengths and weaknesses for each student. 


To assign marks, the lab instructor uses a scoring 
rubric as in Table 5 that consists of 3 skills: (i) idea 
generation or content; (ii) grammar and 
vocabulary; (iii) pronunciation and fluency. 50% of 
the mark is allocated to ideas (content); 30% to 
grammar and vocabulary, and 20% _ for 
pronunciation and fluency, i.e., 50% for the 
content (ideas) and 50% for the form (grammar, 
vocabulary, pronunciation, and fluency). For each 
skill category, a continuum that shows marks for 
the excellent, average, and poor performance are 
given in the cells. If the test is comprised of 10 
questions, then 10 marks are allocated to each 
and the total marks is then converted into 25 for 
the Interm test, or 50 for the final. 


3. DATA COLLECTION AND ANALYSIS 


The 69 students responded to a survey with the 
following open-ended questions about their 
preferences: 


1) 
2) 


Which speaking assessment procedure do 
you prefer and Why? 

Compare test anxiety in both assessment 
methods, the mark you got on both, the kind 
of feedback you get from the instructor in both 
methods, the difficulty level of the questions 
in both methods. 

Which assessment method covers more 
speaking material and tasks practiced in 
class? 

How effective is each method in improving 
your speaking skill? 


3) 


4) 


Similarly, speaking instructors in the sample 
responded to a survey with the following open- 
ended questions: 
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1) 
2) 


Which speaking assessment method do you 
prefer: face to face or language lab and why? 
Compare the face to face and lab assessment 
methods in terms of: (i) speaking skills and 
tasks covered by the test; 

(ii) item difficulty level and discrimination 
power in both; (ili) test anxiety in students; 
(iv) time required for administering the test in 
both methods; (v) cheating in both methods; 
(vi) how tests are scored; (vii) scoring time 
and scoring accuracy in both methods; (viii) 
percentage of passing students in both 
methods; and (ix) feedback given to students 
in both methods. 


Students and instructors’ preferences and views 
are reported in the results section quantitatively 
and qualitatively. 


Moreover, to calculate the reliability coefficient 
and discrimination power of tests in the face to 
face and lab assessment methods, the course 
grades for students who have completed the 
speaking Ill course over 2 semesters were 
obtained from the Registration Department at 
KSU. The internal consistency _ reliability 
coefficient, and the percentage of students who 
got A, B, C, D, and F grades for the speaking 
course were computed for 120 students whotook 
the face-to-face test in the Fall semester, and for 
69 students who took the lab test in the spring 
semester. 


4. RESULTS AND DISCUSSION 
4.1 Students’ Views 


85% of the students who had experience with both 
face-to-face and lab assessment, prefer the lab 
assessment method. The students reported that 
taking the exam in the language lab reduces their 
anxiety as they have some time to think about the 
answer, organize their ideas and they are not 
being watched over by the instructor. In face-to- 
face assessment, they feel tense while waiting for 
their turn to take the test. In lab assessment, they 
are given 8-10 questions and the overall test mark 
(50 marks) is divided by 8or 10 questions. If the 
student misses a question, she will lose few 
marks, but in face-to-face assessment she will 
lose more marks as the whole exam consists of 
one or two topics. When they are given 8-10 
questions to answer, different parts of the 
speaking material and skills and tasks are 
covered. But when one topic is 


given face to face, it covers only part of the 
material, skills or tasks practiced in class. The lab 
test questions are usually of different difficulty 
levels; thus, they match the students’ ability 
levels. Teachers can give written feedback to 
students on their strengths and weaknesses and 
areas of improvement because they grade the 
test at home not on the spot as in face-to-face 
assessment. This feedback is beneficial for them 
and enables them to work on their weaknesses. 
The students reported: 


Najat: Because the lab test has many questions 
that cover all types of skills and tasks that 
we have practiced in class, we study harder 
and practice more before we take the test. 


| feel more relaxed when | take the 
speaking test in the lab because | have 
more time to think and organize my ideas. 
| talk to myself and do not have to look at 
the teacher who is looking at me while 
talking face to face. 


Dalal: 


Noura: Because the lab test focuses on many 
details in the course, | now can speak 
fluently using correct grammar and 
pronunciation and can easily generate 
ideas. 


Nadia: The face-to-face test is easy because | talk 
about one topic and answer a couple of 
questions and finish in less than 10 
minutes. Knowing that the test is going to 
be easy, and most students pass, | do not 
work hard during the semester and do not 
practice outside the classroom unless | 
have to prepare like a presentation or a 
debate. 


Hanan: When we take the speaking test in the lab, 
the teacher gives us back the test papers 
and MP3 players. | know what marks | got 
for each question, on which question | did 


well and which one | did not. 


Furthermore, the students indicated that face-to- 
face assessment is not fair to the students who 
take the test early. Those who take the test later 
have an advantage of knowing the test topics from 
students who have been tested. In_ lab 
assessment, all the students answer the same 
questions at the same time, and there is no such 
leakage of the test questions. The students added 
that their speaking skill improves better in the 
case of lab assessment than _ face-to-face 
assessment, as the lab tests are more 
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demanding and require a lot of studying, practice 
and reviewing. Face-to-face tests are easy and do 
not require much effort in studying and reviewing. 


4.2 Instructors’ Views 


Unlike the students, 80% of the instructors 
surveyed preferred the face-to-face assessment 
for several reasons. Although face-to-face tests 
take a long time to administer to all the students, 
it is easier to prepare and score. Depending on 
the speaking course the students are enrolled in, 
each student answers 3-5 questions on a topic in 
a conversation or interview with the instructor, 
describes a picture, or gives a presentation in 
class about a topic they choose and prepare. In 
the latter case, the instructor does not prepare any 
test topics. The conversations or interviews given 
on the test are realistic. The instructor can see the 
student’s facial expressions, gestures, personality 
(shy or self-confident) and ability to extemporate. 
When the student picks a topic from the basket, 
she does not have time to think about it. But if she 
hesitates or cannot generate ideas, the instructor 
prompts her by asking questions or requesting 
clarification of certain points. It is easier to grade 
face-to-face tests using the rubrics in Table 3. But 
in Table 4, it is time-consuming to add fractions in 
the rubric, and there is a possibility of making 
mistakes while adding them up. The test tasks are 
easy for all the students. As a result, the vast 
majority pass the course and only those who were 
absent in many classes fail. 


On the other hand, the instructors mentioned 
several shortcomings. For example, the test 
questions do not cover the whole material, skills 
or tasks covered in class or in the textbook. The 
students feel nervous, and anxious during the test 
because they engage in a conversation or 
interview with the instructor or give a presentation 
in front of their classmates. Moreover, the test 
conditions for the students in face-to-face 
assessment are different. Teacher fatigue may 
affect her ability to concentrate as the test session 
lasts 5 hours or more. Performance of students 
who take the test in the morning may not be like 
that of students who take the test in the afternoon 
and who wait for their turn to be tested for a long 
time. Students who take the test later, have an 
advantage of knowing the test topics given to 
students who were tested earlier. This will give 
them an opportunity to prepare something and 
even expect which topics might be given to them, 
especially in the case of large classes when 


some students take the test the next day. Inlarge 
classes, the instructor might give the same topics 
to several students because it is not possible to 
prepare 60, 70 or even 80 difference topics. In 
face-to-face assessment, there is no chance to 
give feedback to each student as there is no time. 
The student will only see her grade without 
knowing how the grade was assigned. Some 
instructors wrote: 


Suad: Because speaking skill is difficult to assess 
and requires a lot of time, | give few, easy 
questions to finish giving and scoring it in 
shortest time possible. 


Fatima: | give few, easy questions to help the 
students pass and to avoid their 
complaints because the speaking skill is 
Oifficult. 

Ghadah: When the students fail, they will have to 
take the course over, | will see them 
again in my class the following 
semester and will have more students 
in my speaking class than when all 
students pass. That is why | give easy 
questions that the students can answer 
and hence pass. 


By contrast, 20% of the instructors prefer the lab 
method because it has numerous advantages. 
The lab test takes less time (about 60 minutes) to 
administer than face-to-face assessment which 
takes at least 5 hours. Each student can answer 
between 8-10 questions or tasks. All students 
answer the same set of questions. Unlike face- to- 
face assessment, there is no discrepancy in 
question difficulty level and amount of details 
required. All students take the lab test at the same 
time and under the same conditions. There is no 
way for any student to cheat or know the types of 
test questions. The tasks, questions and topics 
given cover a wide range of skills practiced in 
class, some of which are easy, some are difficult, 
and some are of average difficulty. The total exam 
mark is divided over 8-10 questions rather than 
one or two questions in face-to-face assessment. 
If a student cannot answer a question, she will 
lose few marks only. But in face-to-face 
assessment, she will lose the whole exam mark if 
there is one question and 50% of the mark if the 
test has 2 topics. Each student receives feedback 
on her performance as the instructor writes down 
comments for each student while she is scoring 
answers. Since the lab test has many questions 
with many details, it sorts out students according 
to their mastery of 


44 


Al-Jarf; AJESS, 16(3): 38-50, 2021; Article no.AJESS.67216 


the speaking skills. One will find students who get 
an A, B, C, D and even F, unlike face-to-face 
assessment where students talk about one easy 
topic, and most students pass with high grades 
and no failures. The scoring rubric used for 
scoring lab tests is more balanced as it allocates 
50% of the question mark to content (ideas) and 
50% to the form (vocabulary, grammar, 
pronunciation, fluency...etc). 


Instructors who use lab assessment mentioned 
some shortcomings. For example, lab 
assessment lacks real-life interaction with another 
speaker as the student talks to herself while 
recording her answers. Anyway, topics on a lab 
test do not require a conversation or interview with 
another person. Unlike face-to- face assessment 
in which the instructor assigns each student a 
mark on the spot, scoring lab tests is time 
consuming. But the instructor can grade the 
recordings at her own convenience. Time spent 
on scoring lab test is worth it because the 
instructor can write feedback, can re-check 
answers, and compare different students’ 
performance and make adjustments. Instructors 
commented: 

Majda: /t is true that scoring the student’s 
recordings is time consuming, but | grade 
at my own convenience. | am not under 
pressure as in face-to-face testing where 
| have to finish testing a large number of 
students in a day and | test students non- 
stop, which is very tiring. In the end, the 
total time spent on administering and 
scoring the test in both methods is the 
same. 


Ahlam: In lab testing, | have students’ recordings 
and | can listen to them again to double 
check that | did not overrate or underrate 
responses to any question. | can compare 
students’ responses, recheck comments, 
and modify the marks | have assigned 
using the scoring rubric. 


Afnan: Lab testing results in better mastery of the 
speaking tasks as the students answer a 
variety of questions that cover the whole 
material practiced in class and contained 
in the textbook. 


4.3 Reliability, Validity and _ Discri- 
mination Power of Both Assessment 
Methods 


Statistical analysis of the students’ speaking test 
scores showed that face-to-face assessment is 


less reliable than lab assessment. The internal 
consistency reliability coefficient is .41 for the 
face-to-face test scores compared to .71 for the 
lab test scores. Some factors that affect test 
reliability is the number of questions on a test, 
number of skills tested, the difficulty level and 
variability of the tasks covered by the test 
questions. Taking these factors into 
consideration, the lab test is more reliable. 
Another intervening variable that affects reliability 
of face-to-face tests is that the students do not 
take the test under the same conditions. The 
topics that the students talk about, are not 
identical in theme and difficulty levels and the 
times they take the test during the day (fatigue 
and alertness issues). Those who take the test 
later Know the topics other students spoke about 
earlier (leak out of test topics). 


In addition, face-to-face assessment is less valid 
due to the inclusion of a written component as part 
of a speaking test. This way instructors are testing 
the writing skill, not the speaking skill. 


Moreover, scoring responses and assigning 
marks in face-to-face testing is less reliable 
because the instructor has to make a decision 
regarding the students’ response in a short time. 
There might be variability in assigning marks to 
different students especially those who take the 
test early and those who take the test late. 


Once the students leave, the instructor cannot 
double check and compare their performance 
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with that of other students. This was confirmed in 
a study by [12] which indicated that there is 
variability in raters' scoring decisions and that 
rater's level of expertise affects scores. The time 
taken to reach a scoring decision was found to 
correlate with the reliability and accuracy of 
scores. 


An additional characteristic of a good test is that it 
should discriminate between those who have 
mastered and those who have not mastered the 
skills under study [30,31]. It should also have a 
high discrimination power, especially because 
students at COLT are going to take 6 interpreting 
courses in the subsequent college levels. In this 
respect, the lab test is more discriminating 
because the students respond to numerous 
questions that vary in task, theme, and difficulty 
level. The questions are representative of the 
skills and tasks taught in the textbook. The lab test 
sorts out students into excellent, very good, good, 
below average and those who have not mastered 
the speaking skills. By contrast, face- to-face 
testing is less discriminating because the students 
speak about one easy topic only. End of course 
grades in Table 6 show no failing students, 3.5% 
passed with a D, 13% passed with a C, 43.5% 
passed with a B, and 40% passed with an Ain the 
face-to-face assessment. On the contrary, Table 6 
shows that lab testing is more discriminating as 
32% of the students failed the course, 25% got a 
D, 18% got aC, 18.5% got a B and 10% got an A. 


Table 3. Interm test scoring rubric (face-to-face assessment method) 


Test Content & Grammar Fluency & voice Pronunciation __Toial 
Qs Comprehension control 15/75 
6 marks 3 marks 3 marks 3 marks marks 
Poor excellent Poor excellent Poor excellent Poor excellent 
Q1 12 3 4 5 6 1 2 3 1 2 3 1 2 3 
Q2 12 3 4 5 6 1 2 3 1 2 3 1 2 3 
Q3 12 3 4 5 6 1 2 3 1 2 3 1 2 3 
Q4 12 3 4 5 6 1 2 3 1 2 3 1 2 3 
Q5 12 3 4 5 6 1 2 3 1 2 3 1 2 3 
Table 4. Interm final exam scoring rubric (face-to-face assessment method) 
Test Content & Grammar Fluency & voice Pronunciation Total 
Qs Comprehension control 50 
10 marks 10 marks 10 marks 10 marks marks 
Poor excellent Poor excellent Poor excellent Poor excellent 


Ql 0% 1 1% 2 0% 1 1% 2 


0% 11% 2 0% 1 1% 2 


Q2 
Q3 
Q4 
Q5 


So OoO0O°O 


- uu a 


1% 
1% 
1% 
1% 


MO wONM PP 


[92000 


-_- ou oa et 


1% 
1% 
1% 
1% 


NM wNP 


0% 1 1% 
0% 1 1% 
0 AkdartxAJERS, 
0% 1 1% 


2 


2 0% 
16(): 3&P01 £02. 


2 


0 


0 


Ya 


Va 


1 
1 


1 


1% 2 
4; Ariel no.AJESS.67216 


1% 


1% 


2 


2 


46 


Al-Jarf; AJESS, 16(3): 38-50, 2021; Article no.AJESS.67216 


Table 5. Interm and Final Exam Scoring Rubric for Each Question (Language Lab Assessment 


Method) 
Test Ques. Idea generation (content) Grammar and Pronunciation Total 
Ques. Marks vocabulary and fluency Marks 
50% 30% 20% 100 
Poor excellent Poor excellent Poor excellent 
Q1 10 1 2 3 4 5 1 2 3 1 2 
Q2 10 1 2 3 4 5 1 2 3 1 2 
Q3 10 1 2 3 4 5 1 2 3 1 2 
Q4 10 1 2 3 4 5 1 2 3 1 2 
Q5 10 1 2 3 4 5 1 2 3 1 2 
Q6 10 1 2 3 4 5 1 2 3 1 2 
Q7 10 1 2 3 4 5 1 2 3 1 2 
Q8 10 1 2 3 4 5 1 2 3 1 2 
Q9 10 1 2 3 4 5 1 2 3 1 2 
Q10 10 1 2 3 4 5 1 2 3 1 2 
Table 6. Distribution of Students’ Letter Grades in Speaking Ill for Face-to face and Lab 
Assessment 
A+ A B+ B C+ Cc D+ D F 
Face to face 17.5% 22.5% 27.5% 16% 10% 3% 1% 2.5% 0% 
Assessment 
Lab testing 1.5% 8.5% 8.5% 10% 6% 7% 8.5% 17% 32% 
assessment 
N (Face to face) = 120 N (lab) = 69 


The preference for language lab assessment by 
the students and some instructors in the present 
study is supported by findings of a study by Huei- 
Chun [10] in which Taiwanese college students 
answered questions, described a picture, gave a 
presentation in the language lab, and responded 
to a survey on an audiotape. Huei-Chun [10] 
results showed that the students performed better 
on the speaking task of answering questions and 
exhibited more fluency and complexity. 


5. RECOMMENDATIONS AND CONCLU- 
SION 


To improve the quality of speaking tests at COLT, 
no matter what the assessment method is, the 
study recommends the following: 

1) Since the test length affects test reliability, 
face-to-face tests should require the 
students to talk about at least 3 topics on 
the Interm test and at least 5 topics on the 
final. If conducted in the language lab, 
interm tests and finals should contain twice 
as many questions as those on a face-to- 
face test. 


Since face-to-face testing is time 
consuming and the instructor has to test a 
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large number of students individually, the 
testing time will be reduced if two or more 
speaking instructors collaborate in 
administering a speaking test to 30% or 
40% of the students per assessor. In this 
case assessors should discuss’ the 
assessment method in advance and use a 
unified scoring rubric. Collaborative testing 
will also help increase the number of 
questions and topics on a speaking test and 
make the test more reliable. 

In all testing modes, test instructions must 
specify the type and number of details and 
type of task that must be taken into 
consideration while talking about a topic, 
emphasize correct sentence structure, 
pronunciation, stress, intonation, fluency, 
and so on. 

In selecting speaking test topics, the topics 
should be comparable to but not a repeat or 
identical to what the students have 
practiced in class. 

In scoring speaking tests, a scoring rubric 
must be prepared. 50% of the total test 
mark should be allocated to the content 
(ideas, organization, cohesiveness), 50% to 
the form (grammar, — pronunciation, 
articulation of phonemes, stress, and 
intonation and fluency). In scoring content 
(ideas), marks should be allocated to each 


detail. In grading grammar, pronunciation, 
intonation, stress, and fluency, holistic 
grading is used. If what the student 
has said is off point, no marks are given, 
even if her grammar and pronunciation are 
good because this indicates that she has 
poor listening comprehension and _ poor 
ability to generate ideas suitable for the 
topic. In a study with Taiwanese college 
students, [15] found that the rating method 
had a noticeable impact on how teachers 
judge students’ performance and interpret 
their scores. 

6) No matter which testing mode is used, 
studentsshould be given feedback on their 
performance. Most Taiwanese students in 
study [15] considered feedback helpful in 
improving their speaking ability. 

7) Raising speaking instructors’ awareness of 
the purpose of speaking assessment, and 
different testing modes, individual, paired, 
group face-to-face testing, lab testing is 
necessary. Training speaking instructors in 
selecting test questions, administering face- 
to-face and lab tests, and scoring them ina 
fair and reliable manner as described earlier 
is recommended. [12] indicated that training 
raters of speaking tests resulted in increased 
inter-rater correlation and agreement, and 
improved correlation and agreement with 
established reference scores. 


Finally, speaking instructors must always 
remember that they are not preparing students at 
COLT to pass the speaking courses only, but they 
are preparing translators and interprets who need 
to acquire advanced speaking skills to help them 
in the interpreting courses that they will be taking 
later in the translation program and help them 
acquire the interpreting skills that they will be using 
when they work as interpreters after graduation. 
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APPENDIX 


APPENDIX A: SAMPLE TESTS IN FACE-TO-FACE ASSESSMENT 


TEST 1: Each student answers 1 question only 


1) 


Tell about your family. How many members are there? Are you the oldest/ middle/youngest child? 
Where does your family live? Do you have any responsibilities at home? Do you have any family 
traditions/habits that you do regularly? What are they? What do you like/dislike about these 
traditions/habits and why? 

Tell about your school life. Where did you go for elementary, junior high and secondary school? 
What was the most interesting part of your school life? What was the worst part of your school 
life? What was the hardest thing to learn? What was the funniest thing tolearn? 

Tell about your collage life. Are you happy about your major? Is it like what you have expected? 
What is your most wonderful experience till now? What is your favorite course? When do you 
expect to graduate? What are your plans after graduation? 

What are your plans for your career? What do you like to be? Where do you like to work? why? 
Do you like to study for an M.A. or Ph.D. and why? Do you think that studying for five years is 
enough and why? 

Tell about yourself. What kinds of movies do you like? why? What are your favorite hobbies? 
why? What kinds of sports do you like to watch/practice yourself? why? What is your favorite 
season? why? What is your favorite food? why? 

What are you looking forward to in your social life? When do you like to get married? Do you like 
to have a big, fancy wedding or simple one? why? Who do you like to be married to? Who are 
you going to invite? When do you like to have children? What names do you like to give them? 


TEST 2: The same question is used with each situation. Each student responds to one 


situation only 


Q: What would you do and how would you react in the following situation: 


wero ye rm rN NS NS 


1 
2 
3 
4 
5 
6 
7 
8 
9 
1 


You are alone in a classroom the first day of school. Then another student comes in. 
You are a professor. A new student comes into your office to meet you. 

You are in your office. A new worker enters and seems lost and confused. 

You are in a friend's party. You see a new person standing next to you. 

You meet a friend after being apart for a long time. 

You meet a family member after being apart for a long time. 

You are ina gym. You see another student from your class. 

You are the only passenger in a bus. Another person gets on and sits next to you. 
You are in a friend's dorm room. A new person enters the room. 


0) You come to a party with a friend. The party is very crowded. You are ready to leave, but you 


cannot find your friend. 


11) You see a small child at the park. The child seems lost, and he starts to cry. You are the only 


adult around him. 


12) You just saw a car accident. The person who caused the accident drove away, but you got a 


good look at that person. 


13) Your grandmother is visiting you from another city. She went out for a walk in the park and has 


not returned yet. She went at 4:00 p.m., and it is 9:00 p.m. now. 


14) You bought a coffee maker, but when you took it home and tried to use it, it did not work. 
15) Your friend asks you to return a CD you borrowed from her several weeks ago. You cannot 


remember what you did with it. 
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16) You were at the mall. You bought an expensive watch. Then you entered another store and 
looked at some clothes. When you got home, you realized that you forgot the watch at the clothes 
store. 

17) You went to the mall. When you got home, you realized that you forgot your mobile there. It 
includes some of family, friends and your pictures. 

18) You invite your friend out for dinner to celebrate her graduation. When the waiter brings the bill, 
you realize that you forget your purse/money at home. 

19) You are having people at your home for a tea party. Your mother asks you to watch the cake. 
You forget about it, and the cake gets ruined (not good for eating). 

20) You are walking down the street. Suddenly, you see a friend in front of you. You run up to her, 
but when she turns around , you discover she is a stranger. 

21) You sit down in a chair. Suddenly, you realize that you are sitting on your friend's glasses. The 
glasses are ruined. 

22) Someone comes up to you at a meeting. You do not remember that person, but the person 
knows your name and says that the two of you met before. 


TEST 3: Each student describes 1 picture or advertisement only 


1) Look at the following picture and describe it. What do you like and do not like about it? Why? 
2) Look at the following advertisement. Is it a good or bad advertisement? Why? 


APPENDIX B: SAMPLE TESTS FOR THE LANGUAGE LAB ASSESSMENT 


TEST 1 (Speaking III): Each student answers all 10 questions 

1) Read each word out loud and give its Arabic meaning: 

2) potable, desalination, degeneration, composer, tonsillitis, chancellor, cultural invasion, senate, 
brain drain, commodities, rehabilitation, reclamation, keynote speaker, archaeology, symposium, 
indigestion, consumption, favoritism, high-intensity sounds, delegates (10 marks) 

3) What are the disadvantages of hunting animals like whales, elephants, tigers, snakes in large 
quantities? (10 marks) 

4) Give 5 characteristics of a professor that you admire. (10 marks) 

5) Give 5 causes of high blood pressure. (10 marks) 

6) Factors that make a small business successful. (10 marks) 

7) How can we improve the teaching of English in Saudi schools? (10 marks) 

8) Give 5 actions that would be taken to encourage unemployed college graduates to work as a 
cashier, taxi driver, at a gas station, and reduce foreign labor in the kingdom. (10 marks) 

9) How would you help students reduce wasted time and have more time for studying and 
recreation? (10 marks) 

10) How can we stop students from littering at the university? (10 marks) 

11) Give at least 5 advantages and disadvantages of social media? (10 marks) 
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